Automated extraction of citation data in a large digital library Automated extraction of citation data in a large digital library
نویسندگان
چکیده
The RePEc academic documentation for Economics is the contains the largest distributed digital library in the world. It indexes over 35,000 freely accessible electronic papers is the field of academic Economics. This paper is an intermediate work report on our ongoing efforts to develop an citation linking system for this project. We discuss both technical issues of the construction of the system as well as conceptual issues of its usage.
منابع مشابه
Citation analysis of graduate Dental thesis references: Before and after an intervention
Background: Introduction of Iranian National Medical Digital Library (INLM) was a huge investment during several years ago. The aim of this study was to discover the effectiveness of this scientific intervention by examination of citation pattern among graduate dental thesis during before and after of INLM accessibility. Methods: This analytical study was conducted among all of graduate dental ...
متن کاملManagement of XML Documents in an Integrated Digital Library
We describe a generalized toolset developed by the Perseus Project to manage XML documents in the context of a large, heterogeneous digital library. The system manages multiple DTDs through mappings from elements in the DTD to abstract document structures. The abstraction of document metadata, both structural and descriptive, facilitates the development of application-level tools for knowledge ...
متن کاملمحورهای توسعه کتابخانههای دیجیتالی
Purpose: This paper tries to qualitatively present the issues related to axes of development in digital libraries, including human force, content, services, and technology, and provide a clear viewpoint in this regard by considering all existing aspects. Methodology: In this paper, all existing resources were used. Through citation (library) method, the related literature was studied and, besi...
متن کاملScholarly big data information extraction and integration in the CiteSeerχ digital library
CiteSeer is a digital library that contains approximately 3.5 million scholarly documents and receives between 2 and 4 million requests per day. In addition to making documents available via a public Website, the data is also used to facilitate research in areas like citation analysis, co-author network analysis, scalability evaluation and information extraction. The papers in CiteSeer are gath...
متن کاملA Web Service for Scholarly Big Data Information Extraction - Williams-CiteSeerExtractor-ICWS14
The automatic extraction of metadata and other information from scholarly documents is a common task in academic digital libraries, search engines, and document management systems to allow for the management and categorization of documents and for search to take place. A Web-accessible API can simplify this extraction by providing a single point of operation for extraction that can be incorpora...
متن کامل